Diversity of rationality affects cooperation in spatial prisoner's dilemma game 
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In real world, individual rationality varies for the sake of the diversity of people's individuality. 
In order to investigate how diversity of agent's rationality affects the evolution of cooperation, 
we introduce the individual rationality proportional to the /3th power of the each agent's degree. 
Simulation results on heterogeneous scale-free network show that the dynamic process is greatly 
affected by the diversity of rationality. Both promotion and inhibition of cooperative behavior 
can be observed at different region of parameter l3. We present explanation to these results by 
quantitative and qualitative analysis. The nodes with middle degree value are found to play a 
critical role in the evolutionary processes. The inspiration from our work may provide us a deeper 
comprehension towards some social phenomenon. 

PACS numbers: 



I. INTRODUCTION 

In evolutionary biology, behavioral sciences, and more 
recently in economics, understanding conditions for the 
emergence and maintenance of cooperative behavior 
among selfish individuals becomes a central issue [l], Q ■ 
Including cooperation and defection as the two compet- 
ing strategies, the prisoner's dilemma game (PDG) is re- 
garded as a paradigm for studying this issue Q-p- 

Based on a structured population [3]- [HI, consider- 
able efforts have been extended by allowing the players 
to voluntary participating (l^ . or introducing dyiiamic 
network model fis', u3| , dynamic payoff matrices , dy- 
namic preferential selection ^16j], and difference between 
interaction and learning neighborhoods Santos and 
Pacheco have studied the PDG on heterogeneous 
scale-free networks, and observed that, when the under- 
lying network is scale-free, cooperation can be greatly 
enhanced and becomes the dominating trait throughout 
the entire range of parameters of t he g ame, due to the 
cooperators' cluster forming nature [l9|. 

In the mentioned works, individual particularity is 
not the main topic. However, particularity is ubiqui- 
tous among the individuals of social groups and animal 
species. Thus, the diversity of individuality inevitably 
appears between the players engaging in the evolution- 
ary games. Instead of taking individual difference into 
account directly, some works concentrate on the individ- 
ual similarity |20|-(2^. Recently, it has been directly 
proved that diversity of certain individual property can 
efficiently promote cooperative behavior in evolutionary 
games [H, [23l- The authors introduce social member's 
extrinsically determined properties, like wealth or social 
status, to increase or decrease the fitness of a player de- 
pending on its location on the spatial grid. Different 
scaling factors are provided to different nodes, rescaling 
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their payoff matrix in PDG [23| . 

Different from the diversity of extrinsically determined 
properties [H, this paper concentrates on the di- 
versity of the intrinsic property, individual rationality. 
Szabo's stochastic evolutionary rule [25], especially the 
Fermi upgrading rule, has taken this vital and intrinsi- 
cally determined property into account. In the formula of 
Fermi rule, the variable temperature indicates how ratio- 
nal the individual is, when making decision in the game. 
Just as temperature in statistical physics, this very vari- 
able, in former works [2^-[2^ was viewed mainly as a 
stochastic noise. Phenomena like stochastic resonance 
[13] and second-order phase transition [2^ are discov- 
ered. These works have considered the individual ratio- 
nality to be at the same value for every player in the 
game. Therefore, this variable actually serves as a re- 
flection of collective rationality belonged to the whole 
system. However, in real society, individual rationality 
depends on its intelligence, disposition, motivation, and 
circumstance, which differ from individual to individual. 
Serving as an intrinsic factor, different level of rationality 
determines different choice and correspondingly fosters 
different game result. Within our study, we regard the 
degree of a node as a rank of the individual's certain so- 
cial feature, for instance, social status, and directly relate 
this feature to the individual rationality. An agent's ra- 
tionality is set to be proportional to the /3th power of the 
agent's degree [s^]- In this way, the diversity of rational- 
ity is associated with the diversity of degree. Networks 
with a heterogeneous topological structure are used in 
our work. We reported below that cooperation is en- 
hanced in a certain region of parameter /3, but inhibited 
in other region. 

The model and simulation result are presented in sec- 
tion II. To explore the mechanism for both promotion and 
inhibition of cooperation, we present a statistical analy- 
sis to the dynamic process in terms of microscopic argu- 
ments in section HI. Final conclusion is to be drawn in 
section IV, as well as some sociological inspirations from 
our findings. 
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II. THE MODEL AND SIMULATION RESULT 

To introduce the diversity of the intrinsic property, ra- 
tionaUty, we consider an evolutionary two-strategy pris- 
oner's dilemma game with players located on vertices of 
a heterogeneous network. Each individual is allowed to 
interact with its nearest neighbors, and self-interactions 
are excluded. Players can adopt one of the two simplest 
strategies: "cooperate" (C) and "defect" (D). The strat- 
egy adoption mechanism is based on the rescaled version 
of the payoff matrix introduced by Nowak [3l| : 



C 
D 



C D 
10 
b 



(1 < 6 < 2) 



(1) 



During the evolutionary process, a player located on node 
i can follow the strategy of one of its randomly chosen 
neighbor at node j, with the probability depending on 
the payoff difference {Mi — Mj), 



1 



l + exp[iM^- Mj)/T^] 



(2) 



This is the Fermi updating rule [25|, where Ti charac- 
terize the level of rationality pertained to node i. And 
Tj = denotes complete rationality, where the individ- 
ual always adopts the best strategy determinately; while 
Ti > 0, it introduces some irrational factor, that there 
is small possibility to select the worse one; Ti —>■ oo de- 
notes that the individual is completely irrational, and its 
decision is random. Within this study we consider the 
diversity of rationality defined by the following function 



T, = NT 



(3) 



where N is the total number of nodes in the network, and 
ki is the degree of node i. Here we adopt the Barabasi- 
Albert scale- free network [sl, [s^ . To denotes the average 
value of rationality. We use parameter (3 to tune the rela- 
tionship between node i's degree h and its rationality Ti. 
While /3 < 0, nodes with larger degree gain lower value 
of rationality. While (3 = 0, rationality is uniformly dis- 
tributed. P > denotes a reversed situation as compared 
with the case /3 < 0: nodes with larger degree gain larger 
value of rationality. If /? = 1 , rationality is distributed by 
power-law. It is worth mentioning that significantly high 
value of Ti could induce substantially random behavior 
of an agent, though payoff difference may be large. On 
the contrary, very low Ti would heavily enhance agent's 
sensitivity towards higher payoff. 

The case of /3 = is discussed in [TI, [l^ . Although the 
updating mechanism we adopt is different from [isl . [l9| , 
when T is low, at a microscopic scale the following fact 
still exist: cooperators tend to occupy the hubs, since 
hubs are directly connected, if a defector occasionally 
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FIG. 1: The frequencies of cooperators pc versus the tempta- 
tion to defect b at different average value of rationality To = 1, 
12, and 60, and /3 = —1, 0, and 1. 



takes over one hub, the probability that it gets reoccu- 
pied by a cooperator becomes essentially one. However, 
while using Fermi updating rule, this fact may be affected 
by high rationality value. Fig[T]is plotted to explore the 
influence of rationality in both cases, f3 = and (3^0. 
The BA network is built with the initial number of nodes: 
Too = 2, the number of edges linked to the exiting nodes 
from the newly added node in each time step: m = 2, 
and the average degree fc = 4. The total number of nodes 
is iV = 4225. Before the start of each game simulation, 
both strategies populate the spatial grid uniformly. We 
adopted a synchronous updating scheme. All the simu- 
lation results were obtained by averaging over 1000 gen- 
erations after a transient time of 5000 generations. Each 
data is obtained by averaging over ten different network 
realizations with ten runs for each realization. 

In FiglU in the case of /3 = 0, when To = 1, cooper- 
ation is dominating over the entire range of the temp- 
tation to defect b, but sharp decrease of the frequencies 
of cooperators pc can be measured when Tq becomes 
large. This indicates that the cooperation promoted by 
hubs is indeed sensitive to the value of rationality. At 
the same time, in the case of P = —1, the value of pc 
is not evidently affected by intense variation of Tg, and 
there always exists a broad range in the parameter space 
within which cooperation rule completely. Thus, there 
must be other factors contributing to the facilitation of 
cooperation. 
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FIG. 2: The frequencies of cooperators c versus /3. Left; 
The temptation to defect b = 1.04, 1.08, 1.6, and 1.75. The 
average value of rationality To = 1. Middle: The temptation 
to defect b = 1.4. The average value of rationality To — 0.1, 
0.2, 0.3, 0.4, and 0.5. Right: The temptation to defect b = 1.8. 
The average value of rationality To = 0.1, 0.2, 0.3, and 0.4. 



To further investigate how cooperation is influenced by 
the parameter /?, the variation of pc versus /? is demon- 
strated in Figl^l It is observed that pc and (3 have appar- 
ently non-monotonous relationship. The shapes of the 
curves are similar to a gorge located in a plateau. This 
shape indicates that diversity of rationality can either 
promote or inhibit cooperation, depending on the value of 
p. To highlight the sharp contrast between the effective 
promotion and serious inhibition, in this paper the gorge 
is called cooperation crisis. The two downside graphes of 
FiglH show the cases of relatively small values of average 
rationality, where the similar phenomenon is observed. 
Higher value of Tq or b only results in larger width of 
the gorge, leaving the shape of the curves unchanged. 
These results reveal that the evolutionary dynamics is 
particularly sensitive to rationality distribution. In or- 
der to explain the main features of the reported results, 
especially the cooperation crisis, we hereafter scrutinize 
in depth the microscopic evolution of cooperation. 



III. ANALYSIS AND DISCUSSION 

When P — 0, in [H, [T^, the prevalence of cooperation 
is because that on the heterogeneous network topology, 
hubs can stick together the cooperator cycles that would 
otherwise be disconnected, and form stable cooperative 
clusters (C cluster) [l^. In [l^, a cooperative cluster, 
namely a cooperator core, is a connected component fully 
and permanently occupied by pure cooperators. While 
invaded by defectors, the local structure of a C cluster can 
be viewed as a C strategy hub surrounded by a number 
of periphery neighbors, most of which are cooperators. 
Then whether C clusters are stable or not when (3^0 
is to be analyzed. Two opposite effects generated by 
four crucial dynamic processes determine the fluctuation 
number of cooperators and defectors in a C cluster in 
every next time step, and thus determine the stability of 
the C cluster: 

EfTectl: Corruption of C cluster 

Process (A): the hub node of a C cluster adopts 
the strategy of a periphery defector, and then 
transits to D strategy; 

Process (D): a periphery cooperator adopts the 
strategy of the hub defector, and then transits 
to D strategy. 

Efrect2: Consolidation of C cluster 

Process (B): a hub defector adopts the strategy 
of a periphery cooperator, and then transits 
to C strategy; 

Process (C): a periphery defector adopts the 
strategy of the hub node of a C cluster, and 
then transits to C strategy. 

The four processes are corresponding to four kinds of 
strategy transition probability according to the Fermi up- 
dating rule. Based on mean-field approximation, imaging 
a localized block in the network, a node is surrounded by 
k neighbors among which the cooperators have a propor- 
tion of /i while the defectors have the rest fraction (1 — /x). 
The payoff difference between a cooperator and a defec- 
tor can be denoted by pL(ki — kjb) or fi{kib — kj). Because 
the mean-field hypothesis is not always fit for the evo- 
lutionary games on networks, the following analysis can 
only be qualitative. From equation ([2]) and ([3]), we gain 
the four kinds of transition probability: 
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FIG. 3: The probability of high degree nodes' strategies tran- "-'^ 
siting to the strategies of middle (left) and small (right) de- " S'' 
gree nodes. kL = 172, /cm = 30, and ks ~ 2. The black lines 
denote the probability of process (A), and the red lines de- 
note the probability of process (B). The temptation to defect 
b = 1.4, and the average value of rationality To = 1. 
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The upper scripts H and P denote hub and periph- 
ery respectively. Through these two formulas, the term 
{k^ /k^) remodifies and extends the Fermi rule. 

The strategy transition of a hub or periphery node is 
determined by the four processes. For a hub node, both 
process (A) and (B) could happen; for a periphery node, 
both process (C) and (D) could happen. If occurrence 
rates of process (A) are higher than of (B), and of (D) 
are higher than of (C), separately, then C clusters are 
unstable, and the whole system will asymptotically be 
meshed in a absorbing state of D. We can regard the 
strategy transition probability in a certain process as the 
occurrence rate of this process. 

To calculate the four kinds of transition probability 
of nodes with different degree value, we approximately 
classify the nodes in the following way: (l)Nodes with 
small value of degree: m < ks < k; (2)Nodes with mid- 
dle value of degree: k < km < k'; (3)Nodes with large 
value of degree: k' < k^ < k^ax- Figl3] shows the strat- 
egy transition probability of the high degree nodes(the 
upper scripts L, M, and S respectively denote nodes 
with large, middle, and small value of degree). Sharp 
increase can be observed from the region of cooperation 
crisis (see Figl2]), and the probability of process (A) is 
slightly higher than process (B), but they both become 
equivalent to 0.5 when (3 gets larger. However, in FigHl 
curves concerning the strategy transition probability of 
the middle degree nodes present a symmetrical fashion. 
More importantly, large variation of W only exists in the 
region of crisis, where process (A) always obtains larger 
occurrence rate than process (B), and process (D) always 
obtains larger occurrence rate than process (C). These re- 
sults indicate the advantage of defectors, especially while 
middle degree nodes participating in the game. In our 



FIG. 4: The probability of middle degree nodes' strategies 
transiting to the strategies of high (a) , small (b) , and middle 
(c and d) degree nodes. In graph (a) and (c), black lines 
denote the probability of process (A) , and red lines denote the 
probability of process (B); in graph (b) and (d), black lines 
denote the probability of process (D), and red lines denote the 
probability of process (C). kL = 172, Ami = 12, kM2 = 30, 
and ks ~ 2. The temptation to defect 6 — 1.4, and the 
average value of rationality To = 1. 



calculation, we build a BA network with the largest de- 
gree kmax = 172 and the smallest degree ks = m — 2. 
For simplicity, degrees for the three classes of nodes are 
confined to isolated values. For example, here we set 
k' = 50; high degree value: k^i — 172 and kL2 = 100; 
middle degree value: kMi = 12 and kM2 — 30; low degree 
value: ks = 2; and fi — 0.75. Small value change brings 
no impact on the qualitative results. 

To explore the roots of the agents' diverse behavior, 
Fig|5] is plotted to examine the rationality variation ver- 
sus (3 for the three classes of nodes. The rationality 
of the nodes with smallest and largest degree displays 
a monotonous decrease and increase respectively, while 
that of nodes with other degree value varies in a non- 
monotonous fashion. Mathematical explanation to the 
numerical results is not complicated. It is crucial to note 
that the peak values of Tm are all around the region 
where the cooperation crisis takes place. These peaks 
nicely explain the large variation in FigU) Furthermore, 
the monotonous increase of rationality of the nodes with 
the the largest degree results in the asymmetrical fash- 
ion in Fig[31 as well as in FigO Indeed, rationality value 
change contributes to agents' behavior change. 
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FIG. 5: The rationality of different degree nodes Ts, Tm, and 
Tl versus /3. ks = 2, 3, and 4. = 8, 20, and 30. ks = 100, 
126, and 172. The average value of rationality To = 1.0. 

In the region where the rationaUty of middle degree 
nodes reach their peak, a node with relatively larger de- 
gree becomes rather irrational, and thus gains a much 
higher probability to adopt the strategy of a node with 
relatively lower payoff and lower degree. Simultaneously, 
nodes with relatively lower degree, for their irrationality, 
are not inclined to imitate their larger degree and higher 
payoff neighbors. Consequently, this mechanism deteri- 
orates the validity of the cooperation-facilitating mech- 
anism reported in [H, [l^. As a result of the predom- 
inance of process (A) and (D) against (B) and (C), as 
demonstrated in FigdJ defectors, though initially may 
be the minority in a C cluster, do not only obtain a great 
chance to survive, but also propagate fast and asymp- 
totically dominate the whole network. This is why the 
C clusters fail to maintain their stability and why co- 
operation crisis occurs. The peak rationality values of 
middle degree nodes play a key role. Outside this region, 
cooperative behavior is promoted. 

On the left side of crisis, especially when /3 < 0, 
large and middle degree nodes are very rational. For 
high and middle-ranking defectors, when severely weak- 
ened by the low-ranking neighbors who follow their defec- 
tive strategies, low rationality value will result in much 
greater sensitivity towards payoff, then a little higher 
payoff of a neighbor is enough attractive for them to im- 
itate this neighbor, even if the degree of which is much 
lower. Thereby, they gain much greater chance to tran- 
sit to C strategy than in the case of /3 = 0, in which 
low-ranking players could hardly influence the high and 



middle-ranking ones. Clearly, when /3 < 0, the effi- 
ciency of cooperation promotion is largely enhanced, even 
though To is significantly large. 

On the right side of crisis, the decision of a node with 
high degree becomes random, and its strategy transition 
probability only depends on the proportion between its 
neighboring cooperators and defectors. This irrational 
hub is surrounded by large number of rational nodes with 
middle and small degree, and these nodes can quickly 
form a obedient domain around it and leave it with few 
defective neighbors. On the other hand, highly rational 
neighbors of a hub defector would first adopt the hub's 
strategy, simultaneously resulting in a sudden drop of the 
hub's payoff, then abandon this strategy, for the sake that 
the low payoff hub hardly affects them and their cooper- 
ator neighbors with a little higher payoff could overturn 
their D strategies. After that, the hub defectors will gain 
much more cooperative neighbors, and thus much greater 
probability to transit into a cooperator. These facts ex- 
ist at high To, too. Notably, middle degree nodes are 
rational outside crisis, but irrational in crisis. 

Based on the above discussion, as parameter (3 varies 
from negative value to a large positive value, the system 
experiences successive sorts of dynamic processes, corre- 
sponding to the different parts and different shapes of 
the curves (pc versus /3, show as Figl^]). When /3 is small 
or negative, pc is on the plateau. For nodes with large 
and middle degree, process (A), (B), (C), and (D) have 
approximately the same occurrence rate, and the whole 
system is globally dominated by cooperation. When (3 
gets larger, process (A) and (D) begin to show consider- 
able predominance over process (C) and (B), and thus, 
the stability of C clusters is severely disturbed. When j3 
is further increased, process (A) and (D) becomes over- 
whelming, and C clusters are totally destroyed, pc de- 
cays abruptly into the valley, and the crisis comes. When 
[3 continues to increase, the predominance of process (A) 
and (D) starts to decline, and C clusters begin to res- 
urrect. Finally, cooperative behavior holds global preva- 
lence, and the probability of the four process return to a 
similar value, pc arrives at another plateau. 



IV. CONCLUSION 

To sum up, in this paper, we introduce a set of ratio- 
nality distributions to investigate the effect of rationality 
diversity on evolutionary prisoner's dilemma game on BA 
Scale-Free network. Our model remodifies the Fermi up- 
dating rule, and our results largely extend the results in 
[3, Our work reveals that diversity of individual 

rationality heavily influences the evolutionary process. 
Two routes, produced through two sorts of rationality 
distributions (on the two sides of crisis), promote co- 
operation, even while the average value of rationality is 
high. On the contrary, severe deterioration of coopera- 
tion, namely the cooperation crisis, also appears in an- 
other sort of rationality distribution. By analyzing the 



6 



stability of C clusters, causation of the routes of cooper- 
ation promotion and deterioration is interpreted. 

The crucial contribution made by nodes with middle 
degree value may provide some sociological inspiration. 
Degree is often viewed as a certain rank of game players. 
Perhaps we could analogize it to some social rank of in- 
dividuals, then middle degree nodes might be related to 
the middle class, which is neither the most powerful class 
nor the most populous class. Middle class could serve as 
social stabilizer, pointed out by Samuel P. Huntington 
js^ . However, middle class could also display subversive 
function, argued by Huntington's opponents. Individ- 
ual rationality could be affected by political or econom- 
ical factors, and are not unchangeable. Probably, since 
the organization of society largely depends on the emer- 
gence of cooperation d^-jl^l, such two contrary func- 



tions could be relevant to two rationality level of middle 
class in two sorts of rationality distribution. Further in- 
vestigation on the diversity of rationality might yield new 
insights towards complex social phenomenon. 
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